Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Deduplicate columns to select in add_keep_cols #88

Merged

Conversation

praateekmahajan
Copy link
Contributor

In cudf 24.10 we can't allow duplicate columns to be selected see https://github.com/rapidsai/cudf/pull/16514/files

In our Op.add_keep_cols we add list(dict.from_keys(...)) to ensure we deduplicate the columns. Originally while we were adding columns to output in the for loop, we were then concatenating the lists keep_cols + output.columns (which already had the keep_cols because of the for loop above) which resulted in duplicates

Signed-off-by: Praateek Mahajan <[email protected]>
Copy link
Member

@VibhuJawa VibhuJawa left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Minor nit, otherwise looks good to me.

praateekmahajan and others added 2 commits September 25, 2024 15:20
Signed-off-by: Praateek Mahajan <[email protected]>
@praateekmahajan praateekmahajan merged commit 7d4bccd into rapidsai:main Sep 26, 2024
10 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants